Unsupervised Modeling of Topical Relevance in L2 Learner Text
نویسندگان
چکیده
The automated scoring of second-language (L2) learner text along various writing dimensions is an increasingly active research area. In this paper, we focus on determining the topical relevance of an essay to the prompt that elicited it. Given the burden involved in manually assigning scores for use in training supervised prompt-relevance models, we develop unsupervised models and show that they correlate well with human judgements. We show that expanding prompts using topically-related words, via pseudo-relevance modelling, is beneficial and outperforms other distributional techniques. Finally, we incorporate our prompt-relevance models into a supervised essay scoring system that predicts a holistic score and show that it improves its performance.
منابع مشابه
Measuring Interlanguage: Native Language Identification with L1-influence Metrics
The task of native language (L1) identification suffers from a relative paucity of useful training corpora, and standard within-corpus evaluation is often problematic due to topic bias. In this paper, we introduce a method for L1 identification in second language (L2) texts that relies only on much more plentiful L1 data, rather than the L2 texts that are traditionally used for training. In par...
متن کاملL2 Learners’ Strategy Preference in Metaphorical Test Performance: Effects of Working Memory and Cognitive Style
Although investigating the factors that influence test scores is important, a majority of stakeholders show a paucity of attention towards individual learner differences due to having large classes of L2 learners. This study sought to explore the possible effect of working memory and cognitive style on L2 learners’ metaphorical test performance. The study was conducted in 2 phases. The first ph...
متن کاملLearning First-Order Horn Clauses from Web Text
Even the entire Web corpus does not explicitly answer all questions, yet inference can uncover many implicit answers. But where do inference rules come from? This paper investigates the problem of learning inference rules from Web text in an unsupervised, domain-independent manner. The SHERLOCK system, described herein, is a first-order learner that acquires over 30,000 Horn clauses from Web te...
متن کاملSelf-Regulation, Goal Orientation, Tolerance of Ambiguity and Autonomy as Predictors of Iranian EFL learners’ Second Language Achievement: A Structural Equation Modeling Approach
The identification of the cognitive, affective, social and even physiological factors affecting second or foreign language learning routes and rate has for long been a challenging aspiration for second language researchers. However, a recent preoccupation of the researchers in this area has been the study of the combinatorial impacts of such factors on second or foreign language learning proces...
متن کاملStarting a Sentence in L2 German - Discourse Annotation of a Learner Corpus
Learner corpora consist of texts produced by second language (L2) learners. I We present ALeS Ko, a learner corpus of Chinese L2 learners of German and discuss the multi-layer annotation of the left sentence periphery notably the Vorfeld.
متن کامل